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Unlike standard quantum mechanics, dynamical reduction models assign no particular a 
priori status to "measurement processes" , "apparata" , and "observables" , nor self-adjoint 
operators and positive operator valued measures enter the postulates defining these models, 
fn this paper, we show why and how the Hilbert-space operator formalism, which standard 
quantum mechanics postulates, can be derived from the fundamental evolution equation of 
dynamical reduction models. Far from having any special ontological meaning, we show 
that within the dynamical reduction context the operator formalism is just a compact and 
convenient way to express the statistical properties of the outcomes of experiments. 



I. INTRODUCTION 

Dynamical Reduction Models (DRMs) provide, at the non relativistic level at least, a coherent 
unified description of both microscopic quantum and macroscopic classical phenomena, and in 
particular give a consistent solution to the macro-objectification problem of quantum mechan- 
ics . They are defined by the following set of axioms: 



Axiom A: states. A Hilbert space Ti is associated to any physical system and the state of the 



'Electronic address: bassi@ts .infn.it I 
t Electronic address: ghirard i@ts.infn. it | 
"''Electronic address: lsalvetti@ts.inf'n.Tt1 



2 



system is represented by a (normalized) vector \tp) in TC. 

Axiom B: evolution (GRW model). At random times, distributed like a Poissonian process 
with mean frequency A, each particle of a system of particles is subjected to a spontaneous 
localization process of the form: 



1^*^ ^ iiL„(x)i^,)ir "^"^"^ = V h 2 - ' 

where q„, is the position operator associated to the ?i-th particle, and IV't) is the wave function of 
the global system immediately prior to the collapse; the collapse processes for different particles 
are independent. Between two collapses, the wave function evolves according to the standard 
Schrodinger equation. The probability density for a collapse for the n-th particle to occur around 
the point x of space is: 



p(x) = ||L„(x)|V',)f. (2) 



The standard numerical values [l1 for the two parameters A and a are: A ~ 10^^^ sec^^ and 
a ~ lO^'^ cm~^. A continuous formulation in terms of stochastic differential equations is also 
commonly used 

Axiom C: ontology. Let V(xi,X2, . . .x^v) = (xi, X2, . . . XjviV') the wave function for a system 
of N particles (which for simplicity we take to be scalar) in configuration space. Then 

N 

m{y.,t) = / d^xi . . . d^XAr J^^^ (x„ - x)|V'(xi, X2, . . . XAr)p (3) 

n=l 

is assumed to describe the density of mass^ distribution of the system-^ in three dimensional space, 
as a function of time 

aa. 

As we see, within DRMs concepts like measurement, apparata, observables play no particular 
privileged role; like in classical mechanics, they merely refer to particular physical situations where 
a macroscopic physical system, which we call apparatus, interacts in a specific way with another 
physical system, e.g. a microscopic quantum system. However, such a macroscopic system, the 



^ In the subsequent sections, for simplicity's sake, we will not make reference to the mass density function anymore, 
and we will only keep track of the evolution of the wave function; however it should be clear that, in order to 
be fully rigorous, all statements about the properties of physical systems should be phrased in terms of their 
mass-density distribution. 

^ The mass density in principle refers to the whole universe; however, as standard practice in Physics, one can make 
the approximation of considering only a part of it, which is sufficiently well isolated, and of ignoring the state of 
the rest of the universe. 
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apparatus, is ultimately described in terms of its fundamental constituents, and its interaction 
with other systems is ultimately described in terms of the fundamental interactions of nature. The 
basic idea should be clear: all physical processes are governed by the universal dynamics embodied 
in the precise axioms we have just presented. What is usually denoted as a "measurement" of an 
observable by an apparatus is simply a precise physical process which is purposely caused by a 
human being under controlled conditions. In what follows we will use, to denote such a situation, 
the term "experiment" to conform to the clear-cut position of J.S. Bell, summarized in the following 
lucid sentence js]: 

I am convinced that the word 'measurement' has now been so abused that the field 
would be significantly advanced by banning its use altogether, in favor, for example, 
of the word 'experiment'. 

Given this, the following question arises: why are experiments on microscopic quantum systems 
so efficiently described in terms of average values of self-adjoint operators, and more generally in 
terms of POVMs? Why is the Hilbert space formalism so powerful in accounting for the observable 
properties of microscopic systems? The aim of this paper is to provide an answer to these ques- 
tions, from the point of view of DRMs. We will show that, within DRMs, one can derive a well 
defined role for self-adjoint operators and POVMs as useful (but not compelling) mathematical 
tools which allow to compactly express the statistical properties of microscopic systems subject 
to experiments. Accordingly, within DRMs, recovering the formal aspects of standard quantum 
mechanics is simply a matter of practical convenience and mathematical elegance. Stated in dif- 
ferent terms, while experiments on quantum systems are nothing more than a particular type of 
interaction between a macroscopic (thus always well localized) system and a microscopic one, such 
that different macroscopic configurations of the macro-object correspond to different outcomes of 
the experiment, it is nevertheless simpler to refer to the statistical properties of the outcomes in 
terms of self-adjoint operators averaged over the initial state of the microscopic quantum system. 
Just a matter of practical convenience, nothing more. 

The paper is organized as follows. In Sec. [TI] we show with a simple example how DRMs 
recover the operator formalism for the description of the statistical properties of the outcomes 
of quantum experiments. Sec. IIIII is the core of the paper: we will prove in full generality how 
DRMs allow to associate a POVM to an experiment, i.e. how the statistical properties of the 
experiment can be represented as the average values of the effects of the POVM over the state 
before the "measurement" of the microscopic quantum system. In Sec. IIVI we will show that when 
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an experiment is reproducible, the POVM reduces to a PVM and the experiment can be represented 
by a unique self-adjoint operator, as it is typically assumed in standard text books on quantum 
mechanics. In Sec. |V]we reconsider the so-called "tail" problem and show that it does not affect 
the dynamical reduction program. In Sec. IVII we show with an explicit example that the Hilbert 
space formalism can be used also to describe certain classical experiments. Sec. IVIll contains some 
concluding remarks. 

This paper takes most inspiration from Ref. [9], where the emergence of the operator formalism 
within the framework of Bohmian Mechanics has been thoroughly analyzed. 

II. EMERGENCE OF THE OPERATOR FORMALISM: A SIMPLE EXAMPLE 

We begin our analysis by discussing a simple physical situation which should make clear how, 
and in which sense, the operator formalism of standard quantum mechanics "emerges" in a natural 
way from the physical and mathematical properties of DRMs, even if it does not appear explicitly 
in the axioms defining these models. 



For simplicity's sake, let us consider a spin-1 particle (its Hilbert space being Ti = C^) and let 
us assume that it has been initially prepared in the normalized state 



where \Sz = +1), \Sz = 0), and \Sz = —I) are the three eigenstates^ of Sz, and a, b, and c are 
three complex parameters which can be varied according to the preparation procedure. Let us now 
perform a Stern-Gerlach type of experiment which measures the spin of the particle along the x 
direction. According to the rules of DRMs^ (no other assumption is used, other than axioms A-C, 
in the simplified form suitable for this example), we can state that: 

• Throughout the entire process, the measuring device has always a well defined macroscopic 
configuration. In the particular example we intend to discuss, there are three possible out- 
comes for the experiment, i.e. three possible final macroscopically different configurations of 
the measuring device, which we call e.g. "outcome +1", "outcome 0" and "outcome —1"; 



A. 



A measurement situation 




(4) 



^ There is of course nothing special in the choice of the eigenstates of S^: we could have very well chosen another 

basis of C'^. 
* See [lol ] for a details analysis of this topic. 
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• The outcome is random and the probabihty distribution depends only on the initial state of 
the micro-system, according to a law which, with great accuracy, is equal to: 

P'^(+l) = i|a + V26 + cp, (5) 
pV'(o) = i|a-cp, (6) 
P'^(-l) = ^\a-V2b + c\^. (7) 

It is worthwhile stressing that the above probabilities, which ultimately coincide with the stan- 
dard quantum probabilities associated to the outcomes of a measurement of the spin along the x 
direction, are not postulated, but derive from the dynamics of DRMs, when applied to the specific 
measurement-like situation which has been chosen. Given these premises, we now show how one 
can associate an operator, namely the spin operator Sx, to this specific experiment. 



B. Observables as operators 



Let O = { —1, 0, +1 } be the set of the possible outcomes of the experiment, and let Q be the 
power set oi O {Q = V{0))^ which is an algebra on O. We can then define a probability measure 
on Q in the following obvious way: 



vy eg. 



(8) 



with ¥^{v) given by Eqs. (m)-©. The above definition assumes that is a fixed unit vector, while 
V can vary. Let us now reverse the roles of and V: we fix an element V £ Q and consider ¥^{y) 
as a function of It is easy to recognize that the probability distribution depends quadratically 
on the initial state {tp) (i.e. on the coefficients a, b, c) of the micro-system: as a matter of fact, this 
is the very reason why one can attach an operator to the experiment. Such a property is even more 
evident if we introduce the following normalized and orthogonal states: 



= +1) 
\v = 0) 

= -1) 



1 

2 

1 

1 



\S, = +1)+V2\S, = 0) + \S, = -1) 
[\S, = +1)-\S, = -1)], 

\S, = +1) -V2\S, = 0) + \S, = -1) 



resorting to which we can express ([8]) in the following compact way: 



(9) 
(10) 

(11) 
(12) 
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If we allow \tjj) to run over the entire Hilbert space Ti, not just over the unit sphere^ S^, then 
F'^{V) becomes a quadratic function from 7i to M, being the diagonal part of the sesquilinear form 

pV':</>(y) = 'il:,(f,en, Ffixed. (13) 

Given the bounded sesquilinear form ¥^''^(V), the Riesz representation theorem allows us to express 
it in terms of a bounded linear operator Oy, in the following way: 

]pV,</'(y) = {'iIj\Ov\(I>), (obviously, in this simple case: Oy = ^|f)(v|). (14) 

In this particular example we know also that the operator Oy is self-adjoint. 
Going back to the original P*^, we can then write: 

F^{V) = {i;\Ov\i^), Vy eg, V IV) (15) 

and the set of operators {Oy }veg forms a POVM, as one can easily prove. This is the result we 
wanted to arrive at: thanks to the particular dependence of the probability measure P*^ on tp and 
to the Riesz representation theorem we have been able to express the statistical properties of the 
outcomes of the experiment in terms of average values of the effects of a POVM over the initial 
state of the microscopic system. 

For our particular experiment, the set { Oy }ygg is more than a POVM: each of the eight 
operators Oy is in fact a projection operator, and therefore { Oy }ygg turns out to be a Projection 
Valued Measure (PVM), which is the one associated to the spectral resolution of a self-adjoint 
operator 



/o 1 o\ 

O = Y.v\v){v\ ^ O = - 



V2 



(16) 



1 1 

yo 1 oy 

For this reason, we can rightly associate the operator O to our experiment, in the precise sense 
given here. By inspecting the components of the three vectors \v) given by Eqs. ([9]), (jlOp . and (jlip 
and the explicit form of the operator O displayed in Eq. (|16p . we can now finally recognize that O 
is indeed the component Sx of the spin along the x direction for a spin 1 particle, written in the 
basis \Sz = +1) , \Sz = 0) , \Sz = —1) of the eigenstates of Sz- 



® Of course, F'^CV) can be consistently interpreted as a probability only when \-il>) G <S^. 
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III. THE OPERATOR FORMALISM WITHIN DYNAMICAL REDUCTION MODELS 

In this section, we prove in full generality what we have shown with the previous example, 
namely how the Hilbert-space operator formalism derives from DRMs as a tool to express the 
statistical properties of the outcomes of the experiments. 

A. The link between experimental outcomes and macroscopic positions 

When describing physical experiments, one usually identifies experimental outcomes with real 
numbers; however, what one actually sees (his empirical experience) as the outcome of a measure- 
ment is not a real number, but a specific configuration of a macroscopic object, namely a pointer 
being located in a well-defined region of space®. It is then necessary to set a link between real num- 
bers interpreted as the outcome of an experiment and the position of the pointer of the measuring 
apparatus. We now wish to make precise the conditions which the state of the pointer satisfies 
whenever we experience a perception which we interpret as: "The outcome of the experiment is f", 
V being a real number. 

1. The position of a macroscopic object 

For simplicity's sake, we will analyze the pointer by considering only the spatial degrees of 
freedom of its center of mass, ignoring its spatial extension and orientation, as well as all its 
microscopic degrees of freedom. According to the ontology of DRMs, a pointer is a distribution 
of mass which — being macroscopic — is localized within a small region of space coinciding with the 
spatial extension of the pointer itself, but nevertheless it has small "tails" spreading out to infinity; 
because of this, it is not possible to adopt the point of view according to which the pointer (and, 
in general, any macroscopic object) is located in some given region of space if and only if its mass 
density is entirely contained within that region; we will come back to this point in Sec. El We then 
say that a macro-object is located within a given region of space when almost all its mass density 
distribution is contained within that region. 

As a consequence of the dynamical laws of DRMs, in all standard physical situations'' such 

^ One might also consider different situations, like e.g. the firing of a counter; however, what matters is that in all 
measurement situations the final states of the apparatus differs for a macroscopic mass density distribution. The 
reader will have no difficulty in transcribing the following analysis in such a way that it applies also to the just 
mentioned situations in which there is not any pointer. 

^ Here we do not take into account those pathological situations (whose probability of occurring is vanishingly small) 



8 



as measurement processes, the region around which the wave function of the center of mass of a 
macro-object is localized is extremely small; accordingly, one can also take as the position of a 
macro-object the center of that region, which can be mathematically expressed by the formula 

= ii^tHA) e M^ (17) 

where q is the position operator of the center of mass of A. We stress that should not be 
interpreted as the quantum average of the position operator q, as done in standard quantum 
mechanics, but as the coordinates of the point in space around which, at time t, the mass density 
is appreciably different from zero. In the following, we will use (|17p to denote the position of a 
macro-object. 

Coming back to our pointer, let us assume that the pointer moves only along the graduated 
scale, so that we can treat it as a one dimensional system: we will call G M its position along the 
scale. Since its dynamical evolution is intrinsically stochastic, qt is a random variable : ^ M for 
any time t, where is the sample space of the probability space {^},J-,¥) on which the stochastic 
dynamics is defined; as a matter of fact, at the end of a measurement process we know that the 
pointer is located somewhere along the scale, but we do not know exactly where. Accordingly, the 
physically relevant information is embodied in the probability distribution of qt- 

F,AB] ^ nQ^HB)] B G ^(M), (18) 

which gives the probability that qt lies within the (measurable) subset B of the Borel <7-algebra 
^(M) of M. 

2. The calibration of an experiment 

As we mentioned at the beginning of this section, we usually do not speak of the outcome of an 
experiment in terms of "the pointer being in a particular position in space", but rather in terms 
of "the pointer signaling a particular outcome", typically a real number; this sort of association 
between the spatial position of the pointer and the (numerical) outcome of the experiment 
is of course entirely a matter of convention. E.g., in a Stern-Gerlach type of experiment we 
usually associate the upper part of the screen with the outcome +1 and the lower part with 
the outcome —1, but we do not directly observe the number +1 or —1: what we observe are 
spots either in the upper or in the lower part of the plate or, if we want to keep referring 



in which a macroscopic object can be, for a very short time, in a superpositions of macroscopically different states. 
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to a movable pointer, what we observe is the pointer sitting in two macroscopically different 
positions along the scale, corresponding to the two possible outcomes. The set of outcomes 
|— is conventional, and we are free to use whatever set suits best the interpretation of 
the experiment. In accordance with Ref. [9|], in what follows we will refer to the association 
between directly observed positions of the pointer and conventionally decided outcomes as to the 
calibration of the experiment. In all generality, we define the calibration of an experiment as follows. 



Definition 2: Calibration function. Let O be the set of possible outcomes of an experiment. 
The function 




from the space M of the positions of (the center of mass of) the pointer to the space O of the 
possible outcomes of the experiment, is called the calibration function of the experiment. For 
mathematical convenience, we assume / to be ;S(M)/^-measurable, where ^ is a chosen cr-algebra 
on O. 



By means of the calibration function, we can replace the probability distribution P^j of Eq. (jlSp 
which gives the probability that the pointer lies within i? G M, with the probability distribution 

^fMiv] ^ nq^HrHv))] V eg, (20) 

which represents the probability that the outcome of the experiment belongs to the measurable 
subset V of O, the set of all possible outcomes. 



B. The link between experimental outcomes and microscopic states 

In this section we establish the main result of the paper, namely the link between the prob- 
abilities associated to the possible outcomes of an experiment and the pre-measurement states 
of the microscopic system whose properties are measured, showing how such probabilities can be 
represented by the mean value of the effects of a POVM over the state of the microscopic system 
right before the measurement. 
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1. The characteristic traits of measurement-like situations 



We start our analysis by summarizing the most relevant properties of measurement processes, 
as they are described within DRMs; for a detailed analysis of these properties, we refer the reader 
to Ref. 0. 

Let Xn be the n-th position of the pointer along the graduate scale corresponding to the outcome 
Vn G O, where O is the set of all possible outcomes, to which a cr-algebra Q is associated. Let ip 
be the time at which the experiment ends. According to the dynamical laws typical of all DRMs, 
we can state that: 

Property 1. Throughout the whole measurement process, and in particular at its end, the center 
of mass of the pointer is always very well localized in space; making reference to the model 
analyzed in Ref. [l^, the spread in position of the center-of-mass wave function of a pointer 
having the mass of 1 g is about aq ~ 10~^^ m. 

Property 2. Let us consider an interval (5„, centered around the position x^, whose extension d we 
take of the order of 10"^ cm; then, the probability rj that, at the end of the measurement 
process, qt^ does not lie inside any of the sub-intervals 5n is very small, e.g. rj ~ 10^^^. This 
means that, with probability extremely close to 1, the pointer ends up in one of the positions 
along the graduate scale corresponding to one of the possible outcomes. 

Since the wave function of the center of mass of the pointer has a spatial extension (though 
very small), the event qt^ S (5„ does not imply that most of the center-of-mass wave function lies 
almost entirely within 5„: this happens e.g. when qt^ lies at the border of In order to take this 
possibility properly into account, let us consider a new set of intervals A„ containing the intervals 
5n, each of which is centered around x„, whose extension is equal io d+i where H is large compared 
to the typical spread cjq of the center of mass of the pointer, e.g. I ~ 10^ x cTg ~ 10~^ m (see Fig. [1]). 
We can then say that when qt^ lies within (5„, practically all the center-of-mass wave function lies 
within A„, while the probability for gtp lying in A„ — (5„, i.e. outside 5^, is, according to property 
2, vanishingly small. 

Up to now we have spoken only of the position of the pointer, but as we have said it is custom to 
refer the probabilities to the (numerical) outcomes of the measurement. To this end we introduce 
the following calibration function: 





(21) 
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FIG. 1: The picture shows the points x„ associated to the different outcomes of an experiment, together 
with the intervals bn and A„ used for the proof of Theorem 1. 



which simply means that the outcome of the experiment is Vn whenever the pointer, at the end of the 
measurement process, lies around the position x„ of the graduate scale. In the above definition, we 
have chosen A^, as the relevant intervals in order to make the proof of the following theorem simpler: 
however, on a macroscopic scale the two sets of intervals ({(^n} and {A„}) are practically identical. 
Note that, since we want different outcomes to correspond to different macroscopic configurations 
of the pointer, the intervals A„ should not overlap, which means that the distance between two 
consecutive points x„ and along the graduate scale should be bigger that d + £ ~ 10^^ m; 

this is of course a perfectly reasonable assumption. 

We are now in the position to prove the following theorem, which is the cornerstone for the 
subsequent derivation of the Hilbert space formalism from the framework of DRMs. 



Theorem 1. Let us consider an experiment globally described by the stave vector let qt be 
the position of the (center of mass of the) pointer at time t right before the measurement begins, 



and let tp be the time at which the experiment ends. Then: 



< 2 



(22) 



V "V^ € g : 

where the probability measure is defined as follows: 

and is the projection operator on the (measurable) subset Ay = /~^(V^), which is the subset 
of M of all positions of the pointer corresponding to a precise outcome v among the possible 
outcomes belonging to V . 



(23) 



The above theorem is very powerful: according to the very general structures of DRMs, the prob- 
ability distribution IP'/(qtp) of the outcomes of an experiment can, for all practical purposes, be 
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replaced by the probability measure Qtp which has a the very simple mathematical expression 
given by Eq. ()23p . This is due to Eq. (I22|) and to the fact that, according to the typical numerical 
values for A, t], aq and £ given before, one has: 



+ r] 



2 X lo-^^ 



(24) 



a vanishingly small value. We point out that Eq. (|23p should not be confused with the quantum 
average value of the projection operator Pb relative to the position of the pointer, as given by 
standard quantum mechanics. As a matter of fact, since an experiment always involves a macro- 
scopic system, the one described by iV'i), a state vector which includes both the measured system 
and the measuring apparatus entails a dynamical evolution which is entirely different from that 
predicted by the Schrodinger equation, in particular it always keeps the measuring device in a state 
well localized in space. We now prove the theorem. 



dF - [ iijtAPAyl^ptp) d¥ 
Jn 



< 



{I - {iJtAPAv\i^tp))dF + 



{iJtAPR-Av\i^tp)dF + 



iiJtAPAvli^tp) dF 
{i^tp\PAv\i^tp)dF, (25) 



where is the sample space on which the stochastic dynamics is defined. Let us define 6v = ^n^n 
with 5n £ Ay; the first of the two terms in the last line can be re- written as follows: 



/ {'^tAPR-Av\i^tp)dF = / {'^tAPu-Av\^tp)dF + / i^tplPR-Avli'tp) 



(26) 



according to the second property of DRMs listed above, the probability measure of the set Ay — 5v 
is smaller than ry; thus, taking also into account that (V'ip l-^K-Av IV'tp) ^ 1; we can write: 



/ 



{il^tAPR-Ay\i^tp) dF < rj. 



(27) 



'Qt^i^v-Sv) 

Regarding the first term on the right-hand-side of Eq. (j26p . we now show that the integrand 
{4't-p\P'M.-Av\4't-p) is extremely small for any cj G such that qt^ lies within Sy- Defining il){x) = 
(xjV'tp), we have of course 

{^l^tp\PR-Ay\^tp) = f d{q} f dx\^tA{q},x)\\ (28) 
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where {q} denote all degrees of freedom involved in the measurement, except the one for the 
center of mass of the pointer. Whenever qtp belongs to 5v, we have that the parameter i defining 
essentially the extension of the intervals A„, satisfies i'^ < {x — qt^)"^ for all x S M — Ay; we get 
then the following inequality: 

e [ d{q} [ dx|V't,(M,x)|2 < / d{q} [ dx(x-gtjV*p(W,^)l' 

< / d{q} [ dx(x-(7tp)Vtp(W,2;)P = al (29) 

J JR 

Eqs. dSlD, ([27]) and ^ lead to the following result: 

/, {^tAP^-Ay\^t^)d^ < (30) 
4";(Av) ^ ^ ^ 

A completely symmetric argument holds for the second and last terms at the right-hand-side of 
equation (j25p as well, hence the theorem is proven. 

To summarize, we have shown that in measurement-like situations it is fully legitimate, as a 
consequence of the reducing dynamics of DRM, to use the probability measure in place of Pj(gj) 
to compute the probabilities of the possible outcomes of an experiment. 



2. The emergence of the Hilbert-space operator formalism 

Let us now focus our attention on the probability measure Qj which, according to the defini- 
tion (I23p . can be re- written also as: 

QtiV] = Tv[pt{I{,}^PAy)] V e g, (31) 

where Ijg} is the identity operator acting on the space of all degrees of freedom of the experiment, 
except the one referring to the center of mass of the pointer. The above expression is a consequence 
of the well-known formula typical of DRMs : 

Ep[(V^t|0|Vt)] = Tr[Opt] with: pt = E^Ut){M, (32) 

where O is any suitable operator. We remind the reader that the state vector \ipt) (thus also pt) 
refers both to the (whole) state of the measuring device as well as to the state of the microscopic 
system. Regarding the initial state \ipto) (or, equivalently, pt^), right before the experiment begins, 
we make the following assumption: 
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Assumption 1. At the beginning of the experiment {t = to), the state of the micro-system and that 
of the apparatus are f actor ized: 

Pto = Pi®pt (33) 

where = I represents the initial state of the microscopic system, which we assume 

to be pure, while represents the initial state of the apparatus. 

This initial factorization of the two states is a very natural assumption to make, since otherwise 
the microscopic system would not be in any defined state, whose property the experiment should 
detect. Moreover, we recall the following important property characterizing dynamical reduction 
models: 

Property 3. The dynamical evolution S^j^^p) mapping the density matrix pt^ describing the global 
state prior to the experiment, into the final state pt^ after the experiment, is of the quantum 
dynamical semigroup type, thus, in particular, linear and trace-preserving. 

According to the above two assumptions, we can write Qt, computed at time t = tp, as follows: 

QtAV] ^ TV [^toMiPto ® Pt)hi} ® ^A^] ^ e ^' (34) 

for any fixed V e Q. Assuming now that |V'^) can run over the entire Hilbert space H^, the above 
expression defines the diagonal part of the following bounded sesquilinear form: 

n^^n^ — ^ C 

(35) 

which, according to the Riesz representation theorem, can be written as follows: 

QtAV] = (V'folOyl^fo) = T^iPtoOv], (36) 

where Oy is a bounded linear operator in . In our case, Oy turns out to be also self-adjoint 
and defines a POVM from the measurable space of the possible outcomes {0,Q) to the Hilbert 
space of the micro-system. This is the desired result, which we formalize in the following 
theorem: 

Theorem 2. According to the properties of DRMs stated before (properties 1-3), according to 
assumption 1 and, within the limits set by Theorem 1, one can write: 

^f{<itjV] ^ (^folOyl^fo) VFeg. (37) 
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In other words, the probabihty that the outcome of a given experiment belongs to the measurable 
subset V of the set O of possible outcomes can be (with very high accuracy) expressed as the aver- 
age value of the effect Oy, V & Q, of a POVM, over the initial state \ipto) of the microscopic system. 

We have thus recovered the operator formalism of standard quantum mechanics. 
It is interesting to compare Eq. (I36p with Eq. (I34jl : 

QtAV] = Tr[S(,„,,p)(pf„®p,^JIs®PAv] = Tr^Ov]; (38) 

the middle term of the above equation provides the true physical description of the experiment: it 
gives the probability for the pointer to lie within a well-defined region along the graduate scale at 
time tp; on the other hand, the last term provides the compact (and very handy) quantum way of 
expressing such probabilities in terms of the initial state of the micro-system. A couple of further 
comments are at order. 

1. Clearly, since a wave function has always a spatial extension, the width of the intervals A„ can 
not be set equal to zero. This means that, according to DRMs, only experiments having at most 
a countable number of outcomes can be performed. This of course includes all physically realiz- 
able experiments, while those having a continuous number of outcomes represent a mathematical 
idealization. 

2. Nowhere in our analysis we have explicitly used the fact that the probabilities of the possible 
outcomes of the experiment, as predicted by DRMs, practically coincides with quantum probabil- 
ities. However, such a feature of DRMs is implicitly contained in property 3, i.e. in the fact that 
the evolution law for the statistical operator is linear; the reason is the following. When a jump 
process of the form ([T|) occurs on the n-th particle of a many-particle system, a density matrix 
p = J2i Ci\tpi){tpi\ changes, in accordance to axiom B, as follows: 

where p(x) is the probability density for a jump to occur in x. As we see, the above evolution is 
not linear in p, unless we require p(x) to be of the form ([2]), which agrees with the Born probability 
rule. 

IV. REPRODUCIBILITY AND PVM 

The example we have discussed in Sec. HI] belongs to a particular subclass of experiments, 
because it can be associated to a PVM, while the general theorem of the previous section shows 
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that experiments are in general associated to POVM, which are more general than PVM. In this 
section we show that the possibility of associating a PVM to an experiment is strictly connected 
to the reproducibility of the experiment itself: since on standard books on quantum mechanics it is 
(implicitly) assumed that experiments are reproducible, the analysis shows why usually experiments 
are associated to projection operators. 

We say that an experiment is reproducible if it can be performed many times on the same 
physical system and, each time we perform two runs in a row, one right after the other, the second 
one always yields the same outcome as the first one. In order to give a more rigorous definition, 
we first have to recall another important feature of DRMs: 

Property 4- At the end of a measurement process, the final state IV'tp) of the whole system is, to 
a great accuracy, a factorized state of the system and the apparatus'^: \ipt-p) — \i^t-p) iV't^)- 
Accordingly, the vector |^^) representing the state of the microscopic system right before 
the experiment began, changes to a new (normalized) state which depends in general 

both on IV'^,) and on the stochastic dynamics of the interaction between the system S and 
the apparatus A. 

In the following, when convenient we will write IV'^/V'^) iii place of to stress the dependence 

of the final state on the initial one; moreover, we will often write IV"^^") i^i place of to signify 
that IV'^) is the state to which the systems S is reduced at the end of the first experiment in 
which we suppose that the outcome Vn has been obtained. Let iV't'"") describe the time evolution 
during the second experiment, when the initial state at time at which the second experiment 
takes place is assumed to be: = IV'^^") ^ IV'^)- A reproducible experiment is defined as 

follows. 

Definition 3: Reproducible experiment. An experiment on a physical system S is said to 
be reproducible if and only if: 

1. The experiment can be performed on S at least twice; moreover, the Hilbert space TC of 
vectors describing the possible states of S before the measurement is the same as the Hilbert 
space of vectors describing the possible states of the system after the measurement; in other 
words, the totality of the possible final states IV'^) niust span Tl: 

span{ I Vfp/O, \K) e = (40) 



See [lol] for a quantitative analysis of this feature of DRMs 
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2. Let us suppose that two experiments are done, one immediately after the other; let P^"^^ ^ ^ be 
the probability distribution of the outcomes of the second experiment, assuming that IV'^'"") 
has been taken as the initial state of the microscopic system for the second experiment, i.e. 
assuming that the outcome of the first experiment is Vn- We then require that: 

IP7(,^,)m = l, \fV:v^eV (41) 

i.e. that the outcome of the second run of the experiment belongs to V with certainty. 

Let us briefly comment on the above definition. The first request in the definition above excludes 
experiments which alter the nature of the physical system, e.g. because they destroy it, or one 
of its parts, or because they transform it in a new physical system with a space of states TL' 
different from the original space Ti. The second request just embodies the idea of reproducibility, 
by assuring that the same outcome is obtained with certainty when the experiment is performed 
twice. Now we can state the following theorem: 



Theorem 3: Reproducible experiment. Let us consider an experiment which, according to 
Theorem 2, is associated to a POVM {O^jy- If the experiment is reproducible, and within the 
limits of replacing the probability F'i'} . with the probability Q!i"[y] = Ep[(V'5-''"|Pai/|V'3'''")], 
then the POVM is a PVM. 

The proof of the theorem is given in Ref. Q]; here we propose a simplified version of it. We first of 
all notice that, given a self-adjoint positive semidefinite operator O with bounds < O < I , then 

{i^m) = '^ ^ oiV') = iV') (42) 

{i^m) = 0\i;) = \iv), (43) 

where |w) is the null vector. To show this, let us write Ti = Tii ® Tij^, where Hi is the subspace 
of all eigenstates of O corresponding to the eingenvalue 1 and 7ij^ its orthogonal complement. Let 
be a normalized vector, which we decompose in: = IV^y) + \tp±), where iV^y) G Ti-i and 
\4>±) GTi-i- It follows that: 

(V;|0|V^) = 1 => (V'x I O I V'^) = I V'^) • (44) 

On the other hand, the property < O < / implies that O acts on the subspace Tij^ like a 
contraction, i.e. that \\0 \'ip±) \\ < \\ \ip±) ||, unless \ij±) = This contraction property, together 
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with the Cauchy-Schwarz inequahty, gives 

(-i/'±|0|V'±) < {ip±\i^±) unless \'i(j±) = \uj). (45) 

Now, Eqs. (j44p and (j45p are incompatible unless \ip±) = \uj), which proves Eq. (|42p . In a similar 
way one can prove also Eq. (H3]l . 

Coming back to the probability measure Q!'," , according to the analysis of the previous section 
and to the hypotheses of our theorem, we can write, for any V <Z Q: 

Q:r[V] = {^Pf;'"\Ov\^Pt") = \ (46) 

[ else. 

Let 7iY be the subspace of all eigenstates of Oy corresponding to the eingenvalue 1, and Ti^ be 
the subspace of all eigenstates corresponding to the eingenvalue 0; according to ([Ml), and to the 
fact that spanjl^/^j^-"")} = H, we can write: H = TiY ® Ti'}^ ■ This result proves that Oy is an 
orthogonal projector on 7i, with support in TiY ■ Hence the POVM { Oy }y is a PVM. 

The ideal measurements usually considered in many QM textbook, i.e., those obeying the rule of 
the Wave-Packet Reduction (WPR) postulate, are reproducible, and therefore the POVM associ- 
ated to them reduces to a PVM. When this happens, the experiment can be entirely characterized, 
with respect to its statistical properties, by a single self-adjoint operator, as is usually done in 
quantum mechanics. 



V. THE "TAIL PROBLEM"; THE STERN-GERLACH EXPERIMENT REVISITED 

One of the reason why, with reference to theorem 1, the two probabilities P^^^ and Qtp are not 
strictly equal is that, even when qt S (5„, the wave function for the center of mass of the pointer 
has not a compact support contained in the interval A„, but it has tails spreading out to infinity; 
such tails, being extremely small (in the sense that the integral of the square modulus of the wave 
function over the region laying outside A„, assuming that its center is contained within 5n, is very 
small), give not rise to any problem in connection with the interpretation of the theory and its 
physical predictions. 

In this section we want to point out that precisely the same problem with tails occur also 
in standard quantum mechanics. Let us take as an example the Stern-Gerlach experiment: this 
is often used in textbooks as the paradigmatic experiment which illustrates the correspondence 
between observables and self-adjoint operators. What textbooks usually provide is just a simplified 
description of the truly observed experimental results; a more realistic analysis which takes into 
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account also the spatial (beside spin) degrees of freedom of the atoms sent through the apparatus 
would show that tails emerge also here, which (just in principle) give rise to some potential problems 
in interpreting the outcome of the experiment, and in associating a self-adjoint operator to it. In 
this section we perform such a kind of analysis, we discuss the role of the tails of the wave function 
of the atoms and we make precise the sense in which it is legitimate to associate the usual spin 
operator Sz to the experiment. 

As it is well known, Stern and Gerlach used an oven to produce and send a beam of silver atoms 
through an inhomogeneous magnetic field, letting it eventually impinge on a glass plate. In order 
to analyze the effect of the magnetic field on the beam, two separate experiments were originally 



111 ] performed: one with the magnet generating the field turned on, with a run time of 8 hours, 
another with the magnet turned off, with a run time of 4.5 hours. In the magnet-off case, a single 
bar of silver approximately 1.1 mm long and 0.06-0.1 mm wide was deposited on the glass plate. 
In the magnet-on case, a pair-of-lips shape appeared on the glass: the shape was 1.1 mm long, 
one lip was 0.11 mm wide, the other was 0.20 mm wide; both lips appeared defiected with respect 
to the position of the magnet-off bar, and the maximum gap between the upper and lower lips 
was approximately of the order of magnitude of the width of the lips. Stern and Gerlach made 
only visual observations through a microscope, with no statistics on the distributions: they did not 
obtain "two spots" as it is usually stated on textbooks, and though the beam was clearly split in 
two distinguishable parts, these were not disjoint. They accounted for the experiment as exhibiting 
the property of "space quantization in a magnetic field." 

Let us give a simplified mathematical description of the experiment, taking however into ac- 
count not only the spin, but also the spatial degrees of freedom of (the center of mass of) the silver 
atoms [3]. The silver atoms of a Stern-Gerlach experiment can be treated as spin one-half ele- 
mentary particles to a very high degree of accuracy; let us assume that they are initially prepared 
in an (arbitrary) spin state 

|c+,c_) = c+|+)+c_|-), (|c+|2 + |c„|2 = l), (47) 

where |-|-) and |— ) denote the usual normalized eigenstates of Sz- The wave function describing 
the center of mass of the atom at the initial time t = can be taken to be a normalized Gaussian 
wave packet 

G(x; xo, po, Ax) = (x|xo, po, Ax) (48) 
centered around the position xq, traveling along the x-axis towards the region where the magnet 
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is located, with mean value of momentum po and spatial spread Ax, extremely well localized with 
respect to the dimensions of the region where the magnetic field is different from 0. 

Let us model the interaction between the atom and the magnetic field, which we treat as a 
static external field, by the usual Hamiltonian operator Hsg = B with fx = kcr, B the magnetic 
field, <T = {crx,cry,az) the three Pauli operators, and k a known constant whose value is irrelevant 
for the following discussion. Denoting by z the unit vector along the z-axis, let us assume that 
the inhomogeneous magnetic field is^ B = (Bq — bz) z inside the spatial region where the magnet 
is located and with a negligible gradient outside it. Finally, to simplify the matter as far as 
possible, let us adopt the "impulsive measurement" assumption, which amounts to ignoring the 
free evolution of the silver atom while it is interacting with the magnetic field, and let us suppose 
that the time interval r between the emission of the silver atom from the oven and the moment 
when it impinges on the glass plate is so small that the spread Ax of its wave packet does not vary 
appreciably. 

With these premises, a simple calculation shows that the state of the silver atom at the time t = 
T, after it went through the inhomogeneous magnetic field region and just before the detection by 
the glass plate, is 

\i/jr) = c+ |x+,p+. Ax) 1+) +c_ |x_,p_. Ax) I-) , (49) 

where the wave functions G(x; x-t, p-t. Ax) = (x|x-|-, p-t. Ax) are normalized Gaussian packets of 
mean momentum p± and mean position x-t given by^'': 

P± = Po i kbr z, x± ~ xq + — r lb — z. (50) 

m m 

The detection process by means of the glass plate can be modeled by associating to the plate 
the operator Z = — Z_, acting on the spatial degrees of freedom, where Z^ and Z_ are the two 
projection operators corresponding to the localization in the upper (z > 0) or lower {z < 0) half 
parts of the plate. Recalling that we choose a normalized vector to represent the initial state, the 
corresponding probabilities according to standard quantum mechanical rules are then easily seen 

^ This assumption is inconsistent with the Maxwell equations, in particular with V x B = 0. A more realistic 
assumption would be B = ba;x+ {Bo — bz) z. A simple analysis shows, however, that this second assumption gives 
rise to correcting terms which are inessential for our present discussion. 

In deriving equation (|50p the realistic assumption rkb <C po |p±| — |po| (i.e., that the transverse variation of 
the momentum is negligible with respect to its modulus) has also been taken into account. 
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Outcome 




FIG. 2: Typical (schematic) spatial distribution of the superposition of two Gaussian wave functions in a 
Stern-Gerlach experiment. The wave function corresponding to the outcome "+" has a tail spreading in the 
region corresponding to the outcome "— " , and vice- versa. 



to be 

= |c+|^ (x+,p+, Ax|Z±|x+,p+, Ax) + |c_|^ (x_,p_, Ax|^±|x_,p_, Ax) (51) 

By looking at equation (j5ip . we immediately see that the probability to observe an outcome 
in, say, the upper part of the screen, is not |c+p, as one can find in the textbook descriptions 
of the experiment, because the two wave functions G(x; x±, p-t, Ax) are not localized sharply in 
(respectively) the upper and lower parts of the glass plate. Indeed, the tail of the "+" wave 
function G(x; x+, p+. Ax) which lies in the lower part does not contribute at all to P(c+,c_)(+l)) 
while the tail of the "— " wave function G(x; x_, p_. Ax) laying in the upper part does contribute 
(see Fig. [2]). Needless to say, this is not a disfeature due to the use of a Gaussian wave-packet: the 
whole analysis could be repeated practically unaltered for an arbitrary initial wave packet, because 
even if one starts with a wave packet with compact support, the free evolution would immediately 
and unavoidably spread it all over space. 

However, provided the initial wave packet is sufficiently well localized both in position as 
well as in momentum (within the limits allowed by the Heisenberg principle, of course), and 
the spatial separation of the two final wave packets is much greater than their spatial spreads, 
i.e. 2/c6r^/m S> Ax the "crossed" contributions of the tails, namely (x_|_, p+, Ax|^- |x_|_, p+, Ax) 
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and (x_, p_, Ax|^+|x_, p_, Ax), are small compared with the uncrossed contributions, and the 
textbook approximation P(c^,c_)(=tl) = |c±P can be meaningfully recovered. 

It is interesting to restate the previous argument in the following way, by observing that equa- 
tion (jSTj) can be rewritten in terms of the initial spin state |c+, c_) as 

P(c+,c_)(±l) = (c+,c_|0±|c+,c_) (52) 

where the two effects 0± are: 

0± = (x+,p+, Ax|^±|x+,p+, Ax) |+)(+| + (x_,p_, Ax|Z±|x_,p_, Ax) |-)(-| (53) 

acting on the spin Hilbert space C'^. We thus obtain a result we are now used to: the probability 
distribution of the outcomes has been expressed as the average value of the effects of a POVM over 
the initial state of the microscopic quantum system. However, since the wave packets are spread out 
in space, we see that the effects (f53]l are not projection operators, i.e., ^ 0±. Concluding, the 
observable associated to the Stern-Gerlach experiment is not exactly represented by the operator Sz 
(or, equivalently, by the PVM given by |+)(+| and |— )(— |): it is rather a generalized observable 
described by the POVM formed by the two effects 0±. Thus, to be rigorous, also within standard 
Quantum Mechanics the association of the operator Sz to a Stern-Gerlach experiment is just an 
approximation, whose range of validity has been made clear by the previous discussion: indeed, 
when the crossed contributions of the tails of the two Gaussian wave functions are negligible with 
respect to the uncrossed ones, we have 

0+ ~ |+)(+| and 0_ ~ |-)(-|, (54) 

so that the POVM {0+,0_} associated to the experiment approximately becomes the spectral 
PVM {|+)(+|, |-)(-|} of the spin operator Sz- 

VI. OBSERVABLES AS SELF-ADJOINT OPERATORS IN CLASSICAL MECHANICS: 
AN EXAMPLE FROM CLASSICAL ELECTROMAGNETIC THEORY. 

From the previous analysis one should have grasped that the operator formalism for describ- 
ing the outcome of experiments is not a peculiar feature of Quantum Mechanics but, according 
to the Riesz representation theorem, it can be applied whenever the states of physical systems 
are represented by vectors of a linear vector space and the outcome of an experiment depends 
"quadratically" on the initial state of the system being measured. As such, there is no reason why 
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it should not be possible to apply such a formalism also to classical systems: to illustrate this fact, 
we now give an example taken from classical electromagnetic theory. 

Let us confine our attention to a monochromatic plane wave of frequency to traveling in the 
vacuum in some given direction^^ fi (i.e., with wave vector k = wn). Such a wave is completely 
characterized by the electric field 

E(t,x) = Re(eexp[i(u;t-k-x)]); (55) 

the complex wave amplitude e expresses both the intensity as well as the polarization state of the 
wave. Accordingly, taking into account the linearity of Maxwell's equations and the fact that the 
wave intensity^^ / can be written in terms of e by means of the canonical complex scalar product, 

I = e* -e, (56) 

we see that, if we limit our attention to the degrees of freedom contained in the wave amplitude 
alone, ignoring the spatial ones, a monochromatic plane wave qualifies as a physical system whose 
states belong to the Hilbert space and on which experiments (typically detection of the intesity) 
represented by quadratic forms can be performed. 

One of the simplest examples of such a kind of experiment on a monochromatic plane wave 
can be constructed by letting the wave impinge on a linear Polaroid and by measuring the trans- 
mitted intensity. To further simplify the matter, let us restrict to the case of a linearly polarized 
monochromatic plane wave, i.e., let us assume e G R^. Experience shows that in such a situation 
the transmitted wave intensity If depends on the angle between the directions of polarization of 
the wave and the filter according to the Malus law: 

If = hcos^e, (57) 

where = e • e is of course the incident wave intensity and, if 17 is the polarization direction of the 
filter, cosO = e ■ J7/||e||. Accordingly, Malus law (j57p can be re-written as. 

If = {fi-ef, (58) 

and we easily recognize that the outcome of the experiment (the output intensity If) is a quadratic 
form of the initial state of the wave (its polarization e). Then, according to the Riesz representation 

According to the customary practice in classical theories, in this subsection we will denote vectors with the 
traditional bold notation v rather than the Dirac notation |t;). 

The wave intensity we are referring here to is of course the mean value of the Poynting vector of the wave over a 
time interval long enough with respect to the wave period. 
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theorem, we can express If in terms of a bounded linear operator O as follows (we now pass to the 
Dirac notation to highlight the conclusion): 

Ifiv) = {e\Or,\e) Or, = \rj){v\- (59) 

Of course, in the present example, the average value of the operator Of, over the initial state 
of the micro-system does not represent the average of the possible outcomes weighted with their 
probabilities, but it is simply the (deterministic) value of the output intensity. 

Let us note that this example also shows that non-commutativity, which is usually considered 
as a characteristic trait of quantum mechanics, can and does arise in classical and deterministic 
contexts as well. Indeed, by taking into account the fact that the transmitted wave is polarized 
along the direction of the filter: 

|eout) = \v) ivl^in) , (60) 
and by considering two successive arbitrarily oriented filters, we see that the transmitted intensity 

// = /i cos^ 01 cos^ (02 - ^1 ) (61) 

of the wave after the two filters depends in general on their respective order. The mathematical 
counterpart of this property is that the two operator O^^ and associated to the two experiments 
do not commute, a part from the very special orientations which correspond to |r)i) = ± |J72) or 

imlm) = 0. 

VII. CONCLUSIONS 

In this paper we have shown how the Hilbert-space operator formalism — i.e., the use of self- 
adjoint operators and, more generally, of POVM to describe experiments on quantum systems — 
can be recovered within the context of DRMs: it can be derived from the dynamical laws of 
DRMs satisfying the very general assumptions that we have analyzed in section IIIIBI as a compact 
way to express the statistical properties of the outcomes of measurement-like experiments. It is 
worthwhile stressing once again that within the context of DRMs the operator formalism has no 
special ontological meaning: as this paper shows in detail, it is merely a convenient tool that can 
be used to describe certain experiments that we are accustomed to think of as "measurements." 

This and the previous analysis prove that DRMs represent a well-grounded theory, whose ontol- 
ogy is clearly specified and consistent with our macroscopic perceptions, deprived from the typical 
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paradoxes of quantum mechanics which are connected to the special roles that the theory attributes 
to "measurement processes" and "observables" . 

We consider the results of this paper as an important step which completes, at the non- 
relativistic level, the general line of though and the world view which has inspired the dynamical 
reduction program, both at the formal and interpretational level. 
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